A comparative assessment of the performance of ensemble learning in customer churn prediction

نویسندگان

  • Hossein Abbasimehr
  • Mostafa Setak
  • Mohammad Jafar Tarokh
چکیده

Customer churn is a main concern of most firms in all industries. The aim of customer churn prediction is detecting customers with high tendency to leave a company. Although, many modeling techniques have been used in the field of churn prediction, performance of ensemble methods has not been thoroughly investigated yet. Therefore, in this paper, we perform a comparative assessment of the performance of four popular ensemble methods, i.e., Bagging, Boosting, Stacking, and Voting based on four known base learners, i.e., C4.5 Decision Tree (DT), Artificial Neural Network (ANN), Support Vector Machine (SVM) and Reduced Incremental Pruning to Produce Error Reduction (RIPPER). Furthermore, we have investigated the effectiveness of two different sampling techniques, i.e., oversampling as a representative of basic sampling techniques and Synthetic Minority Over-sampling Technique (SMOTE) as a representative of advanced sampling techniques. Experimental results show that SMOTE doesn’t increase predictive performance. In addition, the results show that the application of ensemble learning has brought a significant improvement for individual base learners in terms of three performance indicators i.e., AUC, sensitivity, and specificity. Particularly, in our experiments, Boosting resulted in the best result among all other methods. Among the four ensemble methods Boosting RIPPER and Boosting C4.5 are the two best methods. These results indicate that ensemble methods can be a best candidate for churn prediction tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Comparative Study of Techniques to Predict Customer Churn in Telecommunication Industry

In present days there is huge competition between various companies in the industry. Due to this companies pay more attention towards their customers rather than their product. They become aware of customer churn issue. Basically when a customer ceases one’s relationship with the company, this misfortune of relationship is known as customer churn. Various data mining approaches are used to pred...

متن کامل

Hierarchical Alpha-cut Fuzzy C-means, Fuzzy ARTMAP and Cox Regression Model for Customer Churn Prediction

As customers are the main asset of any organization, customer churn management is becoming a major task for organizations to retain their valuable customers. In the previous studies, the applicability and efficiency of hierarchical data mining techniques for churn prediction by combining two or more techniques have been proved to provide better performances than many single techniques over a nu...

متن کامل

A Machine Learning Ensemble Approach to Churn Prediction Developing and Comparing Local Explanation Models on Top of a Black-Box Classifier

Churn prediction methods are widely used in Customer Relationship Management and have proven to be valuable for retaining customers. To obtain a high predictive performance, recent studies rely on increasingly complex machine learning methods, such as ensemble or hybrid models. However, the more complex a model is, the more difficult it becomes to understand how decisions are actually made. Pre...

متن کامل

Negative Correlation Learning for Customer Churn Prediction: A Comparison Study

Recently, telecommunication companies have been paying more attention toward the problem of identification of customer churn behavior. In business, it is well known for service providers that attracting new customers is much more expensive than retaining existing ones. Therefore, adopting accurate models that are able to predict customer churn can effectively help in customer retention campaign...

متن کامل

An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction

Several studies have demonstrated the superior performance of ensemble classification algorithms, whereby multiple member classifiers are combined into one aggregated and powerful classification model, over single models. In this paper, two rotation-based ensemble classifiers are proposed as modeling techniques for customer churn prediction. In Rotation Forests, feature extraction is applied to...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. Arab J. Inf. Technol.

دوره 11  شماره 

صفحات  -

تاریخ انتشار 2014